Optimising chemical named entity recognition with pre-processing analytics, knowledge-rich features and heuristics
نویسندگان
چکیده
منابع مشابه
Optimising chemical named entity recognition with pre-processing analytics, knowledge-rich features and heuristics
BACKGROUND The development of robust methods for chemical named entity recognition, a challenging natural language processing task, was previously hindered by the lack of publicly available, large-scale, gold standard corpora. The recent public release of a large chemical entity-annotated corpus as a resource for the CHEMDNER track of the Fourth BioCreative Challenge Evaluation (BioCreative IV)...
متن کاملMongolian Named Entity Recognition System with Rich Features
In this paper, we first build a manually annotated named entity corpus of Mongolian. Then, we propose three morphological processing methods and study comprehensive features, including syllable features, lexical features, context features, morphological features and semantic features in Mongolian named entity recognition. Moreover, we also evaluate the influence of word cluster features on the ...
متن کاملCyrillic Mongolian Named Entity Recognition with Rich Features
In this paper, we first create a Cyrillic Mongolian named entity manually annotated corpus. The annotation types contain person names, location names, organization names and other proper names. Then, we use Condition Random Field as classifier and design few categories features of Mongolian, including orthographic feature, morphological feature, gazetteer feature, syllable feature, word cluster...
متن کاملNamed entity recognition: Exploring features
We study a comprehensive set of features used in supervised named entity recognition. We explore various combinations of features and compare their impact on recognition performance. We build a conditional random field based system that achieves 91.02% F1-measure on the CoNLL 2003 (Sang and Meulder, 2003) dataset and 81.4% F1-measure on the OntoNotes version 4 (Hovy et al., 2006) CNN dataset, w...
متن کاملImproving named entity recognition with prosodic features
In natural language processing (NLP) the problem of named entity (NE) recognition in speech is well known, yet remains a challenge where performance is dependent on automatic speech recognition (ASR) system error rates. NEs are often foreign or out-of-vocabulary (OOV) words, leaving conventional ASR systems unable to recognize them. In our research, we improve a CRF-based NE recognition system ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Cheminformatics
سال: 2015
ISSN: 1758-2946
DOI: 10.1186/1758-2946-7-s1-s6